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Abstract 

Endo-p-1, 4-rriannanase from Thermotoga petrophila (TpMan) is a hyperthermostable enzyme that catalyzes the hydrolysis of 
P-1,4-mannoside linkages in various mannan-containing polysaccharides. A recent study reported that TpMan is composed 
of a GH5 catalytic domain joined by a linker to a carbohydrate-binding domain. However, at this moment, there is no three- 
dimensional structure determined for TpMan. Little is known about the conformation of the TpMan as well as the role of the 
length and flexibility of the linker on the spatial arrangement of the constitutive domains. In this study, we report the first 
structural characterization of the entire TpMan by small-angle X-ray scattering combined with the three-dimensional 
structures of the individual domains in order to shed light on the low-resolution model, overall dimensions, and flexibility of 
this modular enzyme at different temperatures. The results are consistent with a linker with a compact structure and that 
occupies a small volume with respect to its large number of amino acids. Furthermore, at 20°C the results are consistent 
with a model where TpMan is a molecule composed of three distinct domains and that presents some level of molecular 
flexibility in solution. Even though the full enzyme has some degree of molecular flexibility, there might be a preferable 
conformation, which could be described by the rigid-body modeling procedure. Finally, the results indicate that TpMan 
undergoes a temperature-driven transition between conformational states without a significant disruption of its secondary 
structure. Our results suggest that the linker can optimize the geometry between the other two domains with respect to the 
substrate at high temperatures. These studies should provide a useful basis for future biophysical studies of entire TpMan. 
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Introduction 

Lignocellulosic biomass is the major renewable carbon source in 
nature with several applications and consists of cellulose, 
hemicellulose and lignin with an approximate representation of 
2:1:1 [1-3]. Cellulose consists of linear polysaccharides of (3-1,4- 
linked D-glucose residues, whereas lignin is a phenolic macromol- 
ecule. Hemicelluloses are linear or branched heteropolysaccha- 
rides derived from sugars such as D-xylose, D-galactose, D- 
mannose, D-glucose, and L-arabinose [4]. They are classified 
according to the main sugar unit as xylans, galactans and 
mannans, and most of the main-chain sugars on hemicellulose 
polymer are linked together by (3- 1 ,4-glycosidic bonds [4-6]. 
Xylans comprise the major hemicellulose component present in 
hardwoods, whereas mannans are more prominent in softwoods 
[5,7]. Mannans are classified into two principal groups depending 
on whether the P-l,4-linked backbone contains only D-mannose 
(mannans) or a combination of D-mannose and D-glucose residues 



(glucomannans). Linear mannan or glucomannan chains contain- 
ing more than 5% D-galactose are called galactomannans and 
galactoglucomannans, respectively [7]. 

The biodegradation of cellulose and hemicellulose polymers 
involves the concerted action of a variety of hydrolytic enzymes. 
Cellulases and xylanases have been extensively studied due to their 
several industrial applications principally relating to biorefmeries, 
however, mannan-degrading enzymes have enjoyed less attention. 
The major enzymes involved in the biodegradation of mannan 
polysaccharides are (3-mannanase (EC 3.2.178), (3-mannosidase 
(EC 3.2.1.25), and P-glucosidase (EC 3.2.1.21) [7,8]. The enzyme 
P-mannanase is responsible for the cleavage of P-l,4-linked 
internal linkages of the mannan polymer to produce new chain 
ends, whereas P-mannosidase cleaves P-l,4-linked mannosides 
releasing mannose from the nonreducing end of mannans and 
mannooligosaccharides [7,8]. The enzyme P-glucosidase hydro- 
lyzes 1,4-P-D-glucopyranose at the nonreducing end of the 
oligosaccharides released from glucomannan and galactogluco- 
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mannans by (3-mannanase. Additional enzymes such as acetyl 
mannan esterase (EC 3.1.1.6) and oc-galactosidase (EC 3.2.1.22) 
are required to remove side groups that might be attached at 
several points on the mannan polymer, creating more sites for 
subsequent enzyme hydrolysis [7,8]. The biodegradation of 
mannan represents a key step for various industrial applications 
including delignification of kraft pulps, food processing and 
production of second-generation biofuels [6—11]. 

Thermotoga petrophila (T. petrophila) strain RKU-1 (T) is a 
hyperthermophilic bacterium isolated from the Kubiki oil 
reservoir in Niigata Japan) [12]. The temperature range for 
growth is 47-88°C with an optimum at 80°C. The pH range for 
growth is 5-9 with the optimum at pH 7. This bacterium produces 
a repertoire of hyperthermostable enzymes of great potential for 
industrial applications, including cellulases, arabinofuranosidases, 
arabinanases and mannanases, and has proved to be a suitable 
source of enzymes for biotechnological applications and protein 
engineering of glycoside hydrolases [13-15]. For various industrial 
applications, it is essential that mannanase possesses high activity 
and thermostability [11]. 

The hyperthermostable bacterial endo-(3-l,4-mannanase (EC 
3.2.1.78) from T. petrophila (TpMan) is an enzyme composed of 667 
amino acid residues (Gene Bank: ABQ47550.1). The first 20 
amino acid residues are a signal peptide, which are removed after 
protein exportation. TpMan enzyme presents optimal activity at 
the temperature range of 81-93°C [13]. Both TpMan and its 
catalytic domain present maximal activity at the acidic pH range 
of 4.5-6.5. The truncated catalytic domain had optimal activity at 
a lower temperature range when compared to the TpMan enzyme 
[13]. A recent study reported that TpMan consists of a GH5 
catalytic domain (TpMan GH5, 373 amino acid residues) and a 
carbohydrate-binding domain (TpManCBM27, 172 amino acid 
residues) connected through a linker (102 amino acid residues) 
[13]. Furthermore, the same study presented the crystal structure 
of GH5 catalytic domain, however, at this moment, there is no 
three-dimensional structure available for entire TpMan. 

In the present study, the bacterial TpMan was analyzed using 
biophysical techniques as well as with bioinformatics tools. Here, 
we report the first structural characterization of hyperthermostable 
endo-(3-l,4-mannanase from the bacterium T. petrophila by small- 
angle X-ray scattering in order to shed light on the low-resolution 
molecular envelope, overall dimensions, and flexibility of this 
modular enzyme at different temperatures. 

Results 

Temperature Profile of TpMan Enzymatic Activity 

The activity of an enzyme is dependent on pH and temperature, 
conditions that invoke changes in enzyme folding, conformation, 
and protonation. As a first step of our studies, the mannan endo- 
l,4-(3-mannosidase activity of TpMan and TpManGH5 were 
studied as a function of temperature using locust bean gum 
(galactomannan) as substrate (Fig. 1). Our results showed that, 
under the conditions of our study (at pH 6), the enzymatic activity 
of TpMan increases as temperature increases. A similar behavior 
was also observed in the case of TpMan GH5. However, 
TpManGH5 showed a significant decline in the enzymatic activity 
at 85 °C when compared to TpMan (Fig. 1). Thus, our results 
indicate that TpMan is more thermotolerant than the truncated 
catalytic domain. The results presented here are consistent with 
the results published recently for TpMan [13]. 
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Figure 1. Enzymatic activity. Effect of temperature on the enzymatic 
activity of TpMan and TpManGH5. 
doi:10.1371/journal.pone.0092996.g001 

Dynamic Light Scattering (DLS) 

Dynamic light scattering (DLS) provides a fast and convenient 
way to monitor the size of proteins in solution and also investigate 
whether the enzyme aggregates at high temperatures and/or at 
high protein concentrations [16]. DLS size distribution profiles for 
TpMan and TpManGH5 are available in supporting information 
(Figs. SI and S2). As can be seen, the hydrodynamic radii (Rg) of 
TpMan were practically independent of protein concentration 
over the range 0.5 to 8 mg/mL at 20°C and pH 6 (Fig. 2A). 




Concentration (mg/mL) 




Temperature (°C) 

Figure 2. Characterization of TpMan by DLS at pH 6. (A) The 

hydrodynamic radius (ft s ) of TpMan (at 20°C) as a function of the 
protein concentration. (B) The hydrodynamic radius (ft s ) of TpMan (at 
8 mg/mL) as a function of temperature. 
doi:10.1371/journal.pone.0092996.g002 
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Furthermore, the R$ of TpMan exhibited minimal temperature 
dependence over the range 20 to 85°C at pH 6 and 0.5 mg/mL 
(Fig. 3A). After incubation at 85°C (at 0.5 mg/mL) there was no 
evidence of enzyme precipitation and the solution remained 
transparent. The average value of R$ determined for TpMan by 
the DLS method was 36 ±2 A, which corresponds to a molecular 
mass of 68 ±6 kDa, consistent with the expected molecular mass of 
73 kDa. However, when TpMan at pH 6 and 8 mg/mL was 
incubated at temperature values above 65 °C the R s increased 
significantly, suggesting a tendency to form amorphous aggregates 
(Fig. 2B) [16]. After incubation at 85°C (at 8 mg/mL) the initially 
clear TpMan solution was very turbid and precipitate had formed. 

In the case of TpMan GH5, the R s were practically independent 
of temperature over the range 20 to 75°C at pH 6 and 0.5 mg/ 
mL (Fig. 3B). The average Rs determined for TpManGH5 by the 
DLS method was 28 ±2 A, which corresponds to a molecular mass 
of 38 ±5 kDa, entirely consistent with the expected molecular mass 
of 42 kDa. However, when TpManGH5 at pH 6 and 0.5 mg/mL 
was incubated at temperature values above 75 °C the R s increased 
significantly, suggesting a tendency to form amorphous aggregates 
(Fig. 3B). Again, after incubation at 85°C (at 0.5 mg/mL) the 
initially clear TpManGH5 solution was very turbid and precipitate 
had formed. 

Small-Angle X-ray Scattering Data Analysis and Low 
Resolution Modeling 

To obtain more detailed information about the tertiary 
structure of TpMan and its molecular shape, we submitted 
TpMan to small-angle X-ray scattering (SAXS) analysis as a 
function of temperature. The X-ray scattering curves obtained at 
8 mg/mL and three different temperatures (T = 20, 50 and 65°C) 
for TpMan at pH 6 are shown in Fig. 4. Scattering curves 
obtained at 80 °G and higher concentration showed nonnegligible 




q (A 1 ) 

Figure 4. SAXS data collected for TpMan at pH6 and different temperatures. Experimental SAXS curves of the TpMan at 20°C (open black 
circles with errors bars), 50°C (open black triangles with errors bars) and 65°C (open black squares with errors bars) superimposed on the computed 
scattering curves based on the restored low-resolution models (DAMs, solid black lines). The curves have been offset for clarity, 
doiil 0.1 371 /journal. pone.0092996.g004 
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Figure 3. Characterization of TpMan and TpManGH5 by DLS at 

pH 6. (A) The hydrodynamic radius {R s ) of TpMan (at 0.5 mg/mL) as a 
function of temperature. (B) The hydrodynamic radius (ft s ) of 
TpManGH5 (at 0.5 mg/mL) as a function of temperature, 
doiil 0.1 371 /journal, pone.0092996.g003 
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Figure 5. Guinier plots and experimental distance distribution 
functions. (A) Guinier plots (In / versus q 2 ) for TpMan at 20°C (open 
black circles), 50°C (open black triangles) and 65°C (open black squares). 
The curves have been offset for clarity. (B) Experimental P(r) of TpMan 
at 20°C (red solid line), 50°C (black solid line) and 65°C (blue solid line). 
The curves have been scaled to a maximum height of 1.0 for ready 
comparison. 

doi:10.1371/journal.pone.0092996.g005 



The low-resolution molecular shape of TpMan was determined 
from X-ray scattering data with DAMMIN package [18] to 
visualize the shape of the enzyme in solution at different 
temperatures. The resulting consensus envelopes for the temper- 
ature values studied are shown in Fig. 6. In each case, ten 
independent ab initio simulations were performed without imposing 
any symmetry restrictions, and the models described very well the 
experimental curves. The results, in each case, were compared 
with each other by the use of the DAMAVER procedure [19] and 
the most representative model for the whole set is shown in Fig. 6. 
The reconstructed low-resolution model of TpMan at 20°C and 
pH 6, restored at 18 A resolution, showed an elongated L-shaped 
molecule, with a D max of approximately 105 A. However, the 
reconstructed low-resolution model of TpMan at 65 °C and pH 6, 
restored at 18 A resolution, showed a more elongated shape 
roughly twice as long as they are wide, with a D max of 
approximately 1 1 5 A. The atomic coordinates of the low- 
resolution models of TpMan are available by request to the 
corresponding author of this article. A summary of the main 
SAXS results described in this study for TpMan is given in Table 2. 

Kratky Plot and Flexibility 

Information about the global compactness of TpMan can be 
obtained from the SAXS data by the use of the so called Kratky 
plot (q 2 .l vs. q) [20,21]. As can be seen in Fig. 7A, the curve for 
TpMan at 20 °C (at pH 6) shows a well-defined maximum (q < 
0.15 A l ), however, it does not decay close to zero at higher q 
values, presenting a slight elevated baseline. This result suggests 
that TpMan may exhibit some degree of flexibility at 20°C and 
pH 6, probably owing to the inherent flexibility between domains. 
At 50°C and pH 6, the curve shows also a well-defined maximum 
and presents a subtle decrease of the baseline when compared to 
the same enzyme at 20°G (Fig. 7B). Interestingly, at 65°C and 
pH 6 (Fig. 7G) the curve for TpMan decay close to zero at higher 
q values, suggesting that the molecule is less flexible in solution at 
65°C when compared to the same enzyme at 20°G. 



interference effects, and they were not used for analyzes (data not 
shown). The Guinier plots of the data exhibited a linear behavior 
indicating excellent monodispersity of TpMan and that the 
particles are free of significant aggregation (Fig. 5A). Furthermore, 
the variation among the radii of gyration values for different 
protein concentrations and same temperature were insignificants 
(Table 1). The radii of gyration (R g ) evaluated with the Guinier 
approximation at 20, 50 and 65°C were 32±1 A, 34± 1 A and 
36 ±1 A, respectively. The pair distance distribution functions, 
P(r), evaluated by the indirect Fourier transform with GNOM 
package [17] are shown in Fig. 5B. The profiles obtained at 20, 50 
and 65 °C have very similar shapes and trail off to a maximum 
dimension (D max ) of 1 05 ± 5 A, 1 1 0 ± 5 A and 1 1 5 ± 5 A, respec- 
tively. The R g values determined for TpMan at 20, 50 and 65°C 
using the GNOM package were 32.50±0.14A, 34.87±0.15 A 
and 35.98±0.19 A, respectively. 



Molecular Modeling 

A recent study presents the crystal structure of GH5 catalytic 
domain [13], however, at this moment, there is no three- 
dimensional structure determined by experimental methods 
available for entire TpMan. In order to permit the modeling of 
the entire structure, the three-dimensional structure of the last 274 
amino acids residues from TpMan was modeled using bioinfor- 
matics tools. As can be seen from Fig. 8 and Table 3, both the 
threading methods used (HHPRED and PHYRE2) indicated two 
distinct domains. The C -terminal domain (TpManCBM27) 
presents high sequence identity and confidence, in both servers, 
when compared with CBM27 from Thermotoga maritima endo-(3- 
1,4-mannanase (TmMan) (Table 3). Therefore, TpManCBM27 
was directly modeled (Fig. 8) from its 88% sequence identity to 
CBM27 from TmMan whose three-dimensional structure was 
solved by X-ray crystallography [22]. However, the central 



Table 1. Radius of gyration {R g ) calculated with Guinier approximation at different protein concentrations. 





Cone. (mg/mL) 


R g (A) at 20 °C 


R g (A) at 50 °C 


R g {k) at 65 °C 


2 


32±1 


33±1 


36±1 


4 


33±1 


34±1 


35±1 


8 


32±1 


34±1 


36±1 



doi:1 0.1 371/journal.pone.0092996.t001 
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Figure 6. Low-resolution models. (A) Molecular envelope of TpMan in solution at 20°C obtained by DAMMIN package (dummy atoms). The 
center and right structures were rotated y axis-90°and x axis-90° in relation to the left structures. (B) Molecular envelope of TpMan in solution at 50°C 
obtained by DAMMIN package (dummy atoms). The center and right structures were rotated y axis-90°and x axis-90° in relation to the left structures. 
(C) Molecular envelope of TpMan in solution at 65°C obtained by DAMMIN package (dummy atoms). The center and right structures were rotated y 
axis-90°and x axis-90° in relation to the left structures. 
doi:10.1371/journal.pone.0092996.g006 



domain (linker) presents low sequence identity, in both servers, 
when compared with XI domain from Clostridium thermocellum 
cellobiohydrolase A (CbhA) [23]. Despite low values of sequence 
identity obtained in both servers, it is obvious the high level of 
confidence of the selected template (Table 3). Thus, the central 
domain was modeled from its 14% sequence identity and 97% 
confidence to XI domain from CbhA (Fig. 8). The molecular 
model obtained for the central domain is structurally very similar 
to immunoglobulin-like (3-sandwich (Ig-like) domain and it will be 
adopted the nomenclature TpManlg-like for this domain. 

Far-UV Circular Dichroism (CD) Spectroscopy 

Far-UV CD spectroscopy was used to probe the secondary 
structure of the expressed TpMan and TpManGH5 products to 
assess their folding in terms of secondary structure and to draw 
comparisons (Fig. 9). The CD spectrum of TpManGH5 collected 
at 20°C (pH 6) is characterized by a positive band at 195±1 nm 
and two negative bands at 210±1 nm and 220 ±1 nm, together 
with a negative to positive crossover at 203 ± 1 nm (Fig. 9B). These 
bands are typically found in structures with a large content of oc- 
helical secondary structure. This is different to that observed for 
the TpMan CD spectrum, which does not show two minima well- 
defined. The CD spectra for TpMan collected at pH 6 and three 
temperature values is shown in Fig. 9A. The data indicate that the 
secondary structure of the enzyme does not change significantly in 
response to increasing the temperature from 20 to 85 °C. 

The percentages of (X-helix, P-strand, and coils (turns and 
irregular structures) were obtained by deconvolution of the 
experimental spectra using four independent methods and 



expressed as a mean and standard deviation (see the Materials 
and Methods). In the case of TpManGH5, the deconvolution of 
the spectrum yielded the following average values for the 
secondary structure (Table 4): 42 ±4% (X-helix, 1 6 ± 2 % P-sheet, 
and 42 ±5% coil (turn plus random). The corresponding results 
obtained for TpMan were 21 ±4% a-helix, 28±3% P-sheet, and 
51 ±5% turns /irregulars structures (Table 4), indicating an 
increase in P-sheet and coil content and a decrease in cx-helix 
content in the entire enzyme. 

Computational Methods for Secondary-Structure 
Prediction 

The results of the secondary-structure prediction for the 
complete TpMan sequence (Gene Bank: ABQ47550.1) and its 
individual domains are shown in Table 4. The results indicate that 
overall TpMan is composed of roughly equal percentages of 
regular secondary structure (oc-helices and P-sheets) and random 
coil (turns and irregular structures). The regular secondary 
structure of the TpManGH5 is dominated by oc-helix. However, 
the regular secondary structure of the TpManCBM27 and 
TpManlg-like are dominated by P-sheet. 

Ensemble Optimization Method (EOM) Modeling 

To understand better the flexibility information suggests by the 
Kratky plot at different temperatures, ensemble optimization 
modeling (EOM) was applied [24] . The R g and D max distributions 
of the structures generated for the initial pool and the distributions 
of the structures selected during the ensemble optimization are 
shown for 20, 50 and 65°C in Fig. 10. The selected populations of 
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Table 2. General SAXS results from TpMan. 





Parameters/samples 


20°C 










Experimental 


DAM 


RBM 


EOM 




32±1 (Guinier) 
32.50±0.14 (GNOM) 


32.97 


32.30 


31.14 


Dmax (A) 


105±5 


103.90 


104.70 


102.70 


Resolution (A) 18 - 


111 2 




1 .6/2.6 


1 .8/3.2 


1.9/3.6 


50°C 




Experimental 


DAM 


RBM 


EOM 




34±1 (Guinier) 
34.87±0.15 (GNOM) 


35.26 


33.99 


33.22 


o max [k) 


110±5 


110.60 


111.20 


105.17 


Resolution (A) 18 - 



X// 2 - 1.1/1.2 1.3/1.7 2.4/5.8 



65°C 

Experimental DAM RBM EOM 

R g (k) 36±1 (Guinier) 36.41 34.95 34.31 

35.98±0.19 (GNOM) 

Dmax (A) 115±5 116.20 118.20 108.63 

Resolution (A) 18 - 

X/Z 2 - 1-3/1.7 1.5/2.3 2.1/4.4 



doi:1 0.1 371/journal.pone.0092996.t002 

structures became narrower when the temperature was raised 
from 20 to 65 °C, suggesting a decrease in the molecular flexibility 
with the increase in temperature. At 20°C the R g and D max of the 
structures selected during the ensemble optimization are centered 
around 31 A and 103 A, respectively (Table 2). At 50°C the R g 
and D max of the structures selected are centered around 33 A and 
105 A, respectively. Finally, at 65°C the R g and D max of the 
structures selected are centered around 34 A and 109 A, 
respectively. The EOM results indicate that TpMan becomes 
more elongated and less flexible when the temperature increases 
from 20 to 65 °C. The excellent fit profiles determined by the 
EOM modeling process are shown in Fig. 1 1 . The Fig. 1 2 shows 
the structure with highest frequency in the optimized population at 
20°C superposed to the cloud of models. It was considered the 
models that covered 80% of the frequency interval. 

Rigid-Body Modeling (RBM) 

The CORAL package [25] was used to perform rigid-body 
modeling of the three-dimensional structures based on our SAXS 
curves obtained at different temperatures. The three-dimensional 
structures of the TpMan GH5, TpManlg-like and TpManCBM27 
were organized and their relative positions and orientations were 
optimized by the use of a simulated annealing procedure. As a 
result, the three-dimensional arrangement of the domains that 
provides the best fit to SAXS experimental data is obtained. The 
excellent fits profiles determined by the rigid-body modeling are 
shown for 20, 50 and 65°C in Fig. 11. The rigid-body model 
(RBM) obtained at 20°G shows a particle with a D max of 104.70 A 
and 32.30 A. The RBM obtained at 50°C shows a particle 
with a D max of 111.20 A and ^=33.99 A. Finally, the RBM 



obtained at 65°C shows a particle with a D max of 1 18.20 A and 
R g — 34.95 A. Superposition of the low-resolution models obtained 
at different temperatures by the dummy atom modeling approach 
(DAM) and the respective TpMan rigid-body models (RBM) show 
a very good agreement (Fig. 6). Also, the superposition of the 
EOM highest frequency model and the RBM, both obtained at 
20°G and pH 6, shows also a good agreement (Fig. 13). 

Discussion 

Modular proteins in which globular domains are connected by 
linkers are common in nature [26]. It has been recognized that a 
large number of bacterial and fungal carbohydrases are composed 
of a catalytic domain and a carbohydrate-binding domain 
separated by an interdomain linker [27]. A recent study reported 
that TpMan has a modular structure with a catalytic domain and a 
carbohydrate-binding domain connected by a long linker (consid- 
ering its large number of amino acids) [13]. The same study 
presented the crystal structure of the catalytic domain at 1.5 A 
resolution, however, crystallization of the full-length TpMan 
proved to be unsuccessful, probably owing to the inherent 
flexibility between domains [13]. In the present study, entire 
TpMan was analyzed using biophysical techniques as well as with 
bioinformatics tools. The experiments reported here provided the 
global structural characterization of the entire TpMan and also 
the estimation of the influence of the linker and of temperature on 
the conformation and flexibility of this hyperthermostable endo-(3- 
1 ,4-mannanase. 
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Figure 7. Kratky plots. (A) Kratky plot of SAXS data for TpMan at 
20°C transformed as q 2 .l{q) versus q. (B) Kratky plot of SAXS data for 
TpMan at 50°C transformed as q 2 .l{q) versus q. (C) Kratky plot of SAXS 
data for TpMan at 65°C transformed as q 2 .l{q) versus q. 
doi:1 0.1 371 /journal, pone.0092996.g007 



Dimension and Shape of Hyperthermostable Bacterial 
TpMan at 20°C 

The value of the radius of gyration of TpMan (at 20 °C and 
pH 6) calculated from the Guinier plot (R g = 32 ± 1 A) is very close 
to the value obtained from the integral analysis of the scattering 
curve (R g = 32.50±0.14 A) using the method implemented in the 
GNOM package (Fig. 5). Furthermore, the reconstructed envelope 
of TpMan restores at 1 8 A resolution shows an elongated particle 
with a D max of 103.90 A and 7^= 32.97 A (Fig. 6). The 
hydrodynamic radius obtained by DLS (Rs= 36 ±2 A) is consistent 
with the Rs determined for the TpMan low-resolution model 
(7^=38.3 A) employed the program HYDROPRO [28]. The 
SAXS data are therefore consistent with a monomeric structure in 
the form of an elongated L-shaped particle. The values of the radii 
of gyration and of the D max of TpMan both indicate that the 
dimension of the linker is not very large with respect to its mass. 
Therefore, our results are consistent with a linker with a compact 
structure and that occupies a small volume with respect to its 
relative large number of amino acids. Although the resolution of 
the molecular envelope is limited, it represents the first visualiza- 
tion of a full-length TpMan structure reported to date. 

Molecular Model for the Linker 

The low-resolution model obtained by SAXS for the TpMan 
indicates that the linker has a compact structure. To investigate 
this hypothesis we performed additional studies employing 
threading methods and molecular modeling. The templates 
identified by the servers HHPRED and PHYRE2 are carbohy- 



drases from thermophilic bacteria (Table 3). The central domain 
(linker) presents low sequence identity when compared with the 
selected templates (Table 3). However, it is obvious the high level 
of confidence of the selected templates. TpManCBM27 was 
directly modeled using as template GBM27 from TmMan [22], 
which presents high sequence identity and confidence (Table 3). 
However, a consistent molecular model for TpManCBM27 could 
be also directly modeled using a template that presents low 
sequence identity but high confidence, as CBM27 from Caldicellu- 
losiruptor saccharolyticus . Similarly, it was possible to obtain a 
consistent molecular model for the central domain employing 
XI domain from CbhA [23] as template (Fig. 8). 

The central region (linker) of TpMan is a compact domain of 
unknown biological function [22]. The molecular model obtained 
for the central domain is structurally very similar to immunoglob- 
ulin-like (3-sandwich domain (Ig-like). There is convincing 
evidence that these structures are simply a "storage form" of the 
linker that connects other domains of the enzyme that can be 
extended as needed [23]. Alternatively, they may simply function 
as spacers between the others domains. In the case of XI modules 
from CbhA was proposed that a possible role of these modules is 
that of thermo structural stabilizers of the CbhA enzyme complex 
[23]. In the case of TpMan, the deletion of the linker plus 
TpManCBM27 reduced the thermal stability of the TpManGH5 
(Fig. 3), and enzymatic data reveal that the TpManGH5 alone 
losses significantly more catalytic activity than the full enzyme at 
elevated temperature (Fig. 1). Thus, a possible role of the central 
domain is to increase the thermal stability of the whole enzyme as 
proposed to CbhA complex [23]. 

Secondary Structure of Distinct Domains Within Bacterial 
TpMan 

Visual inspection of the CD spectrum of TpMan GH5 indicates 
the presence of cx-helical secondary structure as evidenced by 
negative ellipticities around 210 and 220 nm (Fig. 9B). The spectra 
are strongly characteristic of an a/ (3 protein, with spectral bands 
attributed to electron transitions in the amide groups of the protein 
backbone. Deconvolution of the spectrum using four different 
algorithms yielded the following average values for the secondary 
structure: 42 ±4% a-helix, 16 ±2 (3-sheet, and 42 ±5% coil (turn 
plus random). By comparison, the deconvolution results are in a 
good agreement with published report on the crystal structure of 
the TpManGH5 (41% a-helix, 12 (3-sheet, and 47% coil) [13]. 

As can be seen from Fig. 9, the CD spectrum for the TpMan 
does not show two minima well-defined and shows an increase in 
the negative ellipticities between 210 and 220 nm. These 
differences are suggestive of a greater (3-sheet content for TpMan 
compared with the TpMan GH5, and this difference can only be 
attributable to the TpManCBM27 plus TpManlg-like. This is 
reflected by an increase in (3-sheet content from 16 to 28% on 
deconvolution (Table 4). This was readily confirmed by analysis of 
the molecular models generated for the TpManCBM27 and 
TpManlg-like (Fig. 8). The TpManlg-like molecular model 
contains 40% |3-sheet and 34% coil, and the TpManCBM27 
molecular model contains 6% cx-helix, 60% (3-sheet, and 34% coil, 
consistent with the results of secondary-structure prediction 
(Table 4). Taken together, TpManGH5, TpManlg-like and 
TpManCBM27 models contain 25% a-helical, 29% (3-sheet, 
and 46% coil, entirely consistent with the results of deconvolution 
of the experimental spectrum (Table 4). 
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Figure 8. Threading methods and molecular modeling. The templates obtained by threading methods are carbohydrases from thermophilic 
bacteria. The central domain (TpManlg-like) was directly modeled from to XI domain from CbhA [23]. TpManCBM27 was directly modeled from to 
CBM27 from TmMan [22]. 
doi:1 0.1 371 /journal. pone.0092996.g008 



Kratky Plot Suggests that TpMan Presents Some Level of 
Flexibility at 20°C 

For a system with a compact shape, the Kratky plot shows a 
curve with a bell shape, where an initial rising portion is followed 
by a marked descent thereby forming a well-defined maximum 
[20]. Nevertheless, the expected curve for a polymer in an 
extended or random coil conformation rises to a characteristic 
plateau, with no well-defined maximum [20]. Molecules with 
compact regions connected by flexible linkers can have both 
characteristics, a clear maximum followed by plateau regions. The 
high of the plateau is an indication of the degree of flexibility of the 
molecule. Thus, one can use the behavior of the Kratky plot to 
qualitatively assess the degree of flexibility within the scattering 
molecule [20]. 

The Kratky plot in Fig. 7A shows that the curve for TpMan, at 
pH 6 and 20°C, shows a well-defined maximum (q <0.15 A % 
however, presents a slight elevated baseline at high q values (q > 
0.1 5A l ). This behavior at q >0.15A 1 suggests that TpMan 
presents some level of intrinsic molecular flexibility. Due to their 
architecture, modular proteins can often adopt several conforma- 
tions in solution. The ensemble optimization method (EOM) 
represents an excellent strategy to identify interdomain motions 
unambiguously [24]. Thus, to investigate the hypothesis that 
TpMan presents some level of flexible in solution at 20°C, we 
performed additional studies employing EOM modeling. The R g 
and D max of the structures selected during the ensemble 
optimization are centered around 31 A and 103 A, respectively 



(Fig. 10A). The results obtained using EOM (at pH 6 and 20°C) 
are consistent with a model in which TpMan adopts an ensemble 
of limited conformations represented by the cloud of models in 
Fig. 12. The Fig. 13 also shows the structure with highest 
frequency in the optimized population. 

Three-dimensional Model for the Full-Length TpMan at 

20°C 

Rigid-body modeling was performed to build a model of the 
full-length TpMan by fitting of the scattering curve. The three- 
dimensional structures of the TpMan GH5, TpManlg-like and 
TpManCBM27 were organized and their relative positions and 
orientations were optimized. The rigid-body adjustments of the 
three-dimensional structures resulted in an excellent fit to SAXS 
experimental data (Fig. 1 1A). Superposition of the low-resolution 
model obtained by SAXS and TpMan rigid-body model (RBM), 
both obtained at 20°C and pH 6, shows excellent agreement 
(Fig. 6A) and it is clear that SAXS model of TpMan is compatible 
with a monomer composed of three distinct domains. The rigid- 
body model built shows TpManGH5 and TpManCBM27 
occupying the opposite ends of the molecular envelope, while 
TpManlg-like occupies the central region. Finally, the superpo- 
sition of the EOM highest frequency model with the rigid-body 
model obtained using CORAL package indicates that the results 
are in excellent agreement (Fig. 13). This indicates that, even 
though the full enzyme has some degree of flexibility at 20 °G, 
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Figure 9. Circular dichroism spectroscopy. (A) CD spectrum 
collected for recombinant TpMan at 20°C (black solid line), 50°C (black 
dash line) and 85°C (gray solid line). (B) CD spectrum of TpManGH5 
(black solid line) and TpMan (gray solid line) at pH 6 and 20°C 
doi:1 0.1 371 /journal. pone.0092996.g009 

there might be a preferable conformation, which could be 
described by the rigid body modeling procedure. 

SAXS Reveals Temperature-dependent Conformational 
Changes in TpMan 

A better understanding of the biochemistry and biophysics of 
the processes of conformational change of TpMan as a function of 
temperature is very important to evaluate the relationship of 
enzyme structure, stability, flexibility, and enzymatic activity. 
Little is known about how molecular mobility of carbohydrate- 
binding domain with respect to catalytic domain correlates with 
the enzymatic activity. It seems logical to assume that changes in 
the interplay between the TpManCBM27 and the TpManGH5 
can influence the access of polysaccharides to the active site of the 
enzyme and thus has an impact on the enzymatic activity. In the 
case of CBM27 from TmMan, it was proposed that the hydrolysis 
of polysaccharides is enhanced through targeting of the substrate 
via this carbohydrate-binding domain [22]. 

The SAXS study presented here demonstrate that TpMan 
undergoes a temperature-driven transition between conformation- 
al states (Fig. 5 and 6) without a significant disruption of its 
secondary structure (Fig. 9), providing new insight into the 
function of the enzyme. The increase in average separation 
between the TpMan GH5 and TpManCBM27 as the temperature 
increase from 20 to 65 °C is relatively subtle, but can be seen in the 
data and models (Fig. 5, 6 and 10). This result indicate that the 
linker in the case of TpMan can optimize the geometry between 



the others two domains with respect to the substrate at high 
temperatures. 

Conclusions 

The study reported here establishes the first structural model of 
a three-domain hyperthermostable bacterial endo-(3-l,4-manna- 
nase in solution combining SAXS data analysis and modeling with 
the knowledge of the three-dimensional atomic resolution struc- 
tures of each individual domain. Our results shed light on the 
solution conformation of the entire TpMan enzyme as well as the 
effect of the length and flexibility of the linker on the spatial 
arrangement of the constitutive domains. It was demonstrated that 
the linker occupies a small volume with respect to its relative large 
number of amino acids. Furthermore, the results also indicate that 
the linker is a compact domain and structurally very similar to 
immunoglobulin-like (3-sandwich domain. At 20°C, even though 
the full enzyme has some degree of flexibility, there might be a 
preferable conformation, which could be described by the rigid 
body modeling procedure. Furthermore, TpMan undergoes a 
temperature-driven transition between conformational states 
without a significant disruption of its secondary structure. Finally, 
the EOM results indicate that TpMan becomes more elongated 
and less flexible when the temperature increases from 20 to 65 °C. 
Taken together, the results indicate that the linker in the case of 
TpMan can optimize the geometry between the others two 
domains with respect to the substrate at high temperatures. 

Based on these observations we advocate that the linker is very 
important to guarantee the optimal conditions for TpMan 
enzymatic activity due to at least three factors: i) the thermo- 
stabilization of the whole enzyme, ii) adequate degree of molecular 
flexibility between the structural domains and in) optimize the 
geometry between the others domains with respect to the substrate 
at high temperatures. These studies should provide a useful basis 
for future biophysical studies of entire TpMan and others 
multimodular carbohydrases. 

Materials and Methods 

Materials 

Nickel-nitrilotriacetic acid resin (Ni-NTA), imidazole, kanamy- 
cin, LB medium, isopropyl-(3-D-thiogalactopyranoside (IPTG) and 
locust bean gum were purchased from Sigma- Aldrich. All 
chemicals and reagents used in this study were of the highest 
purity analytical grade. 

Expression and Purification of Recombinant TpMan and 
TpManGH5 

The expression and purification of endo-(3-l,4-mannanase 
(TpMan) and its isolated catalytic core domain (TpManGH5) 
were carried out as described previously with minor modification 
[13], and the purity of the final product verified by SDS-PAGE. 
The resulting TpMan and TpManGH5 were exhaustively 
dialyzed in 20 raM acetate-borate-phosphate buffer adjusted at 
pH 6 for elimination of imidazole and NaCl. The concentrations 
of the recombinant proteins were determined by UV absorbance 
at X = 280 nm using a theoretical extinction coefficient based on 
the amino acid composition. The theoretical coefficients employed 
were S280 nm = 156,900 M 'cm 1 for TpMan and 
£280 nm= 108,875 M'Vrn" 1 for TpMan GH5. The final products 
TpMan and TpMan GH5 were then frozen and stored at — 80°C 
and were melted on ice before use. 
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Table 4. CD deconvolution and secondary-structure prediction. 





CD deconvolution* 


domains 


a-helix (%) 


p-sheet (%) 


coil (%) 


TpManGH5 


42±4 


16±2 


42±5 


TpMan 


21 ±4 


28±3 


51 ±5 


Three-dimentional structure** 


domains 


a-helix (%) 


P-sheet (%) 


coil (%) 


TpManGH5 


41 


12 


47 


TpManlg-like 


0 


40 


60 


TpManCBM27 


6 


60 


34 


TpMan 


25 


29 


46 


Secondary-structure prediction*** 


domains 


a-helix (%) 


P-sheet (%) 


coil (%) 


TpManGH5 


31-40 


11-17 


45-57 


TpManlg-like 


0-9 


42-51 


43-57 


TpManCBM27 


0-11 


36-60 


34-65 


TpMan 


18-26 


23-32 


39-59 



*The percentages of a-helix, p-strand, and coils were obtained by deconvolution of the experimental spectra using four independent methods and expressed as a mean 
and standard deviation. 

**The secondary structures were obtained from the three-dimensional structures of TpManGH5, TpManlg-like and TpManCBM27. 

***The computational methods BALDIG, PHD, PSIPRED, and PORTER are compared. The maximum and minimum values obtained for each type of secondary structure 
are given. 

doi:1 0.1 371/journal.pone.0092996.t004 



Enzymatic Activity Measurements 

Mannan endo-l,4-(3-mannosidase activity was determined using 
as substrate 1% against locust bean gum (galactomannan) 
dissolved in a buffer. Activity of TpMan and TpManGH5 were 
measured in the temperature range between 20 and 85°C (20 mM 
acetate-borate-phosphate buffer adjusted at pH 6). The reaction in 
each experiment was performed by mixing 20 uL of the diluted 
enzyme (1.5 (xM) with 80 |iL of galactomannan for 10 min. The 
reaction was stopped by the addition of DNS reagent followed by 
boiling the samples at 100°C water bath for 5 min [29]. One unit 
of mannan endo-l,4-(3-mannosidase activity was defined as the 
amount of enzyme needed to release 1 jimol of mannose 
equivalent per minute. All experiments were done in triplicate, 
and average values are reported. 

Dynamic Light Scattering (DLS) 

The size characteristic of the purified TpMan and TpManGH5 
samples were examined by means of the Nano-ZS dynamic light 
scattering system (Malvern Instruments Ltd, Malvern, UK). This 
system employs a 633 nm laser and a fixed scattering angle (173°). 
Protein solutions (between 0.5 and 8 mg/mL), in 20 mM acetate- 
borate-phosphate buffer adjusted at pH 6, were first passed 
through a 0.22 (im filter (Millipore, USA), centrifuged at 
1 6,000 xg for 10 min at room temperature, and subsequently 
loaded into a cuvette prior to measurement. The temperature was 
raised from 20 to 85°G and the samples were allowed to 
equilibrate for 2 min in each temperature prior to DLS 
measurements, after which multiple records of the DLS profile 
were collected. In each case the hydrodynamic radius was 
obtained from a second order cumulant fit to the intensity auto- 
correlation function (size distribution by volume). 



Small-Angle X-ray Scattering (SAXS) Data Collection 

For SAXS measurements, TpMan was measured at different 
protein concentrations (2, 4 and 8 mg/mL in 20 mM acetate- 
borate-phosphate buffer adjusted at pH 6) and temperatures 
(T = 20, 50 and 65°C). The samples were passed through a 
0.22 um filter (Millipore, USA) and centrifuged at 16,000 xg for 
10 min at room temperature prior to measurement. No concen- 
tration effects were detected for the samples. The measurements 
were carried out on a laboratory SAXS instrument Bruker- 
Nanostar, placed at the Institute of Physics of University of Sao 
Paulo. This equipment is improved by the use of microfocus 
source Genix3D coupled with Fox3D multilayer optics and two 
sets of scatter less slits for beam definition, all provided by Xenocs. 
The samples were kept on a quartz capillary glued to a stainless 
steel case, which enabled the proper rinse and reuse of the sample 
holder, permitting an accurate background subtraction. The 
scattering of water measured on the same sample holders was 
used to normalize the data to absolute scale. The sample 
temperature was controlled by a Peltier system. Several 900 s 
frames were recorded for each sample to monitor radiation 
damage and beam stability. The wavelength of the incoming 
monochromatic X-ray beam was X=\.54A (Cu koc ) and the 
sample to detector distance was 0.67 m, providing an q (scattering 
vector) interval from 0.008 to 0.35 A \ where q = 4 n sin(Q)/X 
and 9 is half the scattering angle. The 2D scattering data was 
collected on a Vantec2000 detector and the integration of the 
SAXS patterns were performed by the use of the Bruker SAXS 
software. The data treatment, normalization to absolute scale and 
averaging procedures were performed by the use of the SUPER- 
SAXS package [Oliveira and Pedersen, unpublished]. 
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Figure 10. EOM modeling. (A) The dash lines are the distributions 
{R g and D max ) produced by the software from the SAXS data collected at 
20°C, whereas the solid lines are the final distributions of structural 
parameters from the models selected by the ensemble optimization 
method [24]. (B) The dash lines are the distributions {R g and D max ) 
produced by the software from the SAXS data collected at 50°C, 
whereas the solid lines are the final distributions of structural 
parameters from the models selected by the EOM [24]. (C) The dash 
lines are the distributions {R g and D max ) produced by the software from 
the SAXS data collected at 65°C, whereas the solid lines are the final 
distributions of structural parameters from the models selected by the 
EOM. 

doi:1 0.1 371 /journal, pone.0092996.g010 
SAXS Data Analysis 

The radius of gyration (R g ) of the molecules were determined by 
two independent procedures: i) by the Guinier equation 
I{q) = I{0).exp[{-q 2 .R g 2 )/3], q<l.3/R g , and ii) by the indirect 
Fourier transform method using the GNOM package (www. 
embl-hamburg.de/biosaxs) [17]. The distance distribution func- 
tion P[r) was also evaluated with GNOM software and the 
maximum diameter (D max ) was obtained. 

SAXS ab initio Modeling 

Dummy atom models (DAMs) were calculated from the 
experimental SAXS by ab initio procedures implemented in 
DAMMIN package (www.embl-hamburg.de/biosaxs) [18]. The 
low-resolution models obtained, in each case, were compared with 
each other by the use of the DAMAVER [19] procedure and the 
most representative model for the whole set was used. The 
resolution (R) was estimated from the equation R = 2n/ q max . The 
GRYSOL package (www.embl-hamburg.de/biosaxs) was used to 
generate theoretical scattering curves from DAMs [30]. R g and 
D max were determined with the same package. 



Figure 11. RBM and EOM fits. (A) Experimental SAXS curves of the 
TpMan at 20°C (open black circles with errors bars), 50°C (open black 
triangles with errors bars) and 65°C (open black squares with errors 
bars) superimposed on the computed scattering curves based on the 
rigid-body models (RBMs, solid black lines). (B) Fits (solid black lines) of 
the best profiles determined by EOM modeling to the SAXS data 
collected at pH 6.0 and 20°C (open black circles with errors bars), 50°C 
(open black triangles with errors bars) and 65°C (open black squares 
with errors bars). 

doi:10.1371/journal.pone.0092996.g011 
Rigid-Body Modeling and EOM Modeling 

Rigid-body modeling of the TpMan was performed with the 
CORAL package [25]. CRYSOL was used to generate the 
simulated scattering curves from the rigid-body model. The rigid- 
body model and ab initio DAMs were superimposed with de 
SUPCOMB package [31]. Additional structural modeling em- 
ployed the ensemble optimization modeling method (EOM) 
described by Bernado et al. [24] . Superposition figures were 
generated by the PyMOL program (www.pymol.org). 

Molecular Modeling 

The remote homology detection servers HHPRED [32] and 
PHYRE2 [33] were used to search for homologs of the last 274 
amino acids residues from TpMan in the Protein Data Bank (PDB) 
with default parameters. Molecular models for TpManCBM27 
and TpManlg-like were built using restraint-based homology 
modeling, as implemented in the program Modeller software [34] . 
The alignment of the TpManCBM27 against the sequence of 
CBM27 from Thermotoga maritima endo-(3-l,4-mannanase was 
initially used as input to the Modeller program, together with 
the atomic coordinates of the latter (PDB 10H4). The alignment 
of the TpManlg-like against the sequence of XI domain from 
Clostridium thermocellum cellobiohydrolase A was initially used as 
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Figure 12. EOM model. Selected model (cartoon) having the highest 
frequency in the optimized population superposed to the cloud of 
models. The structure shown is rotated 90°about the long axis of the 
structure to produce the two views. 
doi:1 0.1 371 /journal. pone.0092996.g01 2 

input to the Modeller program, together with the atomic 
coordinates of the latter (PDB 3PE9). 

Computational Methods for Secondary-structure 
Prediction 

The theoretical extinction coefficients, based on the amino acid 
composition (Gene Bank: ABQ47550.1), for TpMan and 
TpManGH5 were obtained from the ProtParam utility available 
on the ExPaSy server (www.expasy.org). The methods used for 
general secondary-structure prediction were PHD [35] , PSIPRED 
[36], BALDIG [37], and PORTER [38]. 

Far-UV Circular Dichroism (CD) Spectroscopy 

Far-UV CD spectra were collected using a JascoJ-815 
spectropolarimeter equipped with a temperature control device. 
TpMan and TpManGH5 concentrations were 5 uM in 20 raM 
acetate-borate-phosphate buffer adjusted at pH 6. All data were 
collected using 0.1 cm quartz cuvette and the spectra were 
recorded over the wavelength range from 195 to 250 nm. In the 
case of TpMan, CD measurements were collected at different 
temperature values (T = 20, 65 and 85°C). Eight accumulations 
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were averaged to form the CD spectra, taken using a scanning 
speed of 100 nm min \ a spectral bandwidth of 1 nm, and a 
response time of 0.5 s, and obtained on degree scale. The buffer 
contribution was subtracted in each of the experiments. Spectra 
were transformed to molar ellipticity hello 0] using the mean weight 
residue and concentration prior to the secondary-structure 
analysis. To obtain structural information, CD spectra were 
deconvoluted using the SELCON3 [39], CONTIN [40], CDS 
[41], and K2D3 [42] programs using different databases. 

Supporting Information 

Figure SI DLS temperature experiments with TpMan. 

(A) The size distribution by intensity for purified TpMan (0.5 mg/ 
mL) where DLS runs were conducted between 20-85 °C. (B) The 
size distribution by volume for purified TpMan where DLS runs 
were conducted at 20, 30, 40, 50 and 85°C. 
(TIF) 

Figure S2 DLS temperature experiments with 
TpManGH5. (A) The size distribution by intensity for purified 
TpManGH5 (0.5 mg/mL) where DLS runs were conducted 
between 20-7 0°C. (B) The size distribution by volume for purified 
TpManGH5 where DLS runs were conducted at 20, 30, 40, 50 
and 70°C. 
(TIF) 
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